High quality multi-pulse based CELP speech coding at 6.4 kb/s and its subjective evaluation

نویسندگان

  • Kazunori Ozawa
  • Masahiro Serizawa
چکیده

This paper proposes an MP-CELP (Multi-Pulse-based CELP) speech coding at 6.4 kb/s with 10 ms frame. In MP-CELP, amplitudes or signs of multi-pulse excitation are simultaneously vector quantized (VQ). A combination search between multiple pulse location candidates and VQ codebook remarkably improves the quantization performance. In order to improve speech quality for background noise conditions, an adaptive pulse location restriction method is developed. The subjective evaluation results show that speech quality for 6.4 kb/s MP-CELP is higher than that for G.726 at 32 kb/s and is equivalent to that for 6.3 kb/s G.723.1 with 30 ms frame in clean speech and tandem conditions. For background noise conditions, the adaptive pulse location restriction significantly improves MOS value by 0.9. The speech quality is equivalent to that for G.723.1, but still does not reach to that of 24 kb/s G.726, except interference talker condition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

4 kb/s multi-pulse based CELP speech coding using excitation switching

Thispaper proposes an MP-CELP (Multi-Pulse-based CELP) speech coding at 4 kb/s. In MP-CELP, amplitudes or signs of multi-pulse excitation are simultaneously vector quantized (VQ). In order to improve speech quality for background noise conditions, excitation signal is switched between voiced and unvoiced speech, and the number of pulse is greatly increased for unvoiced speech by restricting pul...

متن کامل

Hybrid MELP/CELP coding at bit rates from 6.4 to 2.4 kb/s

This paper describes extensions of the 4 kb/s hybrid MELP/CELP coder, up to 6.4 kb/s and down to 2.4 kb/s. The baseline 4 kb/s coder uses three coding modes: MELP in strongly voiced speech frames, CELP with pitch prediction in weakly voiced frames, and CELP with stochastic excitation in unvoiced frames. To minimize switching artifacts between parametric MELP and waveform CELP coding, an alignme...

متن کامل

Using Various Types of Excitation Signals

A high-qulaity speech coding method (SPMEX) at 4.8 kb/s is proposed. The SPMEX selects a suitable excitation signal, based on the decision from aconstic features of speech signal in a frame. lmproved pitch interpolation multi-pulse (PMPC) excitation is selected for vowel-like speech. In PMPC, multi-pulse during only one pitch period is calculated in the frame. Fnrther, gain and phase adjusting ...

متن کامل

Analysis by synthesis speech coding with generalized pitch prediction

A new analysis-by-synthesis speech coding structure is presented for high-quality speech coding in the 4 to 8 kb/s range. CELP with generalized pitch prediction (GPP-CELP) di ers from classical code-excited linear prediction (CELP) in that for voiced segments it is the speech signal that is decomposed into a component predictable with the aid of the adaptive codebook (ACB) and a nonpredictable ...

متن کامل

High quality MELP coding at bit-rates around 4 kb/s

Recently, a number of coding techniques have been reported to achieve near toll quality synthesized speech at bit-rates around 4 kb/s. These include variants of Code Excited Linear Prediction (CELP), Sinusoidal Transform Coding (STC) and Multi-Band Excitation (MBE). While CELP has been an effective technique for bit-rates above 6 kb/s, STC, MBE, Waveform Interpolation (WI) and Mixed Excitation ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998